Talend Big Data – Machine learning
SubscriptionThis content is available for Talend Academy subscription users.Instructor-ledThis content is available as instructor-led training.
Talend provides a development environment that lets you interact with many source and target big data stores, without having to learn and write complicated code.
This course covers the implementation of machine learning algorithms in Big Data Batch Jobs using the Spark framework.
Duration: 1 day (7 hours)
Target audience: Anyone who wants to use Talend Studio to industrialize machine learning algorithms
Prerequisites: Completion of Talend Data Quality Essentials or Talend Big Data Basics
Learning objectives: After completing this learning plan, you will be able to:
-
Connect to a Hadoop cluster from a Talend Job
-
Use context variables and metadata
-
Read and write files in HDFS in a Big Data Batch Job
-
Configure a Big Data Batch Job to use the Spark framework
-
Create and test recommendation models
-
Create and test classification models
-
Use a machine learning algorithm to deduplicate data
Training modules: To complete the learning plan, take the following training modules: